Missing exposure data in stereotype regression model: application to matched case-control study with disease subclassification.
نویسندگان
چکیده
With advances in modern medicine and clinical diagnosis, case-control data with characterization of finer subtypes of cases are often available. In matched case-control studies, missingness in exposure values often leads to deletion of entire stratum, and thus entails a significant loss in information. When subtypes of cases are treated as categorical outcomes, the data are further stratified and deletion of observations becomes even more expensive in terms of precision of the category-specific odds-ratio parameters, especially using the multinomial logit model. The stereotype regression model for categorical responses lies intermediate between the proportional odds and the multinomial or baseline category logit model. The use of this class of models has been limited as the structure of the model implies certain inferential challenges with nonidentifiability and nonlinearity in the parameters. We illustrate how to handle missing data in matched case-control studies with finer disease subclassification within the cases under a stereotype regression model. We present both Monte Carlo based full Bayesian approach and expectation/conditional maximization algorithm for the estimation of model parameters in the presence of a completely general missingness mechanism. We illustrate our methods by using data from an ongoing matched case-control study of colorectal cancer. Simulation results are presented under various missing data mechanisms and departures from modeling assumptions.
منابع مشابه
Comparison of the missing-indicator method and conditional logistic regression in 1:m matched case-control studies with missing exposure values.
The missing-indicator method and conditional logistic regression have been recommended as alternative approaches for data analysis in matched case-control studies with missing exposure values. The authors evaluated the performance of the two methods using Monte Carlo simulation. Data were generated from a 1:m matched design based on McNemar's 2 x 2 tables with four scenarios for missing values:...
متن کاملپیشبینی ابتلا به کمبود ویتامین D در سالمندان و ساکنان خانههای سالمندان شهر تبریز با استفاده از مدل رگرسیون استریوتایپ
Objectives: Vitamin D deficiency is one of the most important health problems of any society. It is more common in elderly even in those dwelling in rest homes. By now, several studies have been conducted on vitamin D deficiency using current statistical models. In this study, corresponding proportional odds and stereotype regression methods were used to identify threatening factors related to ...
متن کاملModelling Association Among Bivariate Exposures In Matched Case-Control Studies
The paper considers the problem of modelling association between two exposure variables in a matched case-control study, where both the exposures may be partially missing. The exposure variables could all be categorical or continuous or could be a mixed set of some categorical and some continuous variables. Association models for the missing exposure variables using the completely observed cova...
متن کاملمعرفی الگوریتم های مدل رده بندی درختی و کاربرد آن در تعیین عوامل مؤثر بر ابتلا به سرطان مری در استان گلستان
Background & objective: One of the common purposes of medical research is Determination of effective factors on the occurrence of event. Due to the interaction of risk factors regression models, discriminant analysis and classification procedures used. Uses of these models require making the assumption which in the medical data isn’t usually established. Therefore, alternative methods must be u...
متن کاملA blended model for estimating of missing precipitation data (Case study of Tehran - Mehrabad station)
Meteorological stations usually contain some missing data for different reasons.There are several traditional methods for completing data, among them bivariate and multivariate linear and non-linear correlation analysis, double mass curve, ratio and difference methods, moving average and probability density functions are commonly used. In this paper a blended model comprising the bivariate expo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Biometrics
دوره 67 2 شماره
صفحات -
تاریخ انتشار 2011